NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Pre-trained Large Language Models Use Fourier Features to Compute Addition

Zhou, Tianyi; Fu, Deqing; Sharan, Vatsal; Jia, Robin (December 2024, The Thirty-eighth Annual Conference on Neural Information Processing Systems)

Full Text Available
Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression

Fu, Deqing; Chen, Tian-Qi; Jia, Robin; Sharan, Vatsal (December 2024, The Thirty-eighth Annual Conference on Neural Information Processing Systems)

Full Text Available
When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models

https://doi.org/10.18653/v1/2024.emnlp-main.574

Chang, Ting-Yun; Thomason, Jesse; Jia, Robin (January 2024, Association for Computational Linguistics)

Full Text Available
Robust encodings: a framework for combating adversarial typos

https://doi.org/10.18653/v1/2020.acl-main.245

Jones, Erik; Jia, Robin; Raghunathan, Aditi; Liang, Percy (July 2020, Transactions of the Association for Computational Linguistics)

Despite excellent performance on many tasks, NLP systems are easily fooled by small adversarial perturbations of inputs. Existing procedures to defend against such perturbations are either (i) heuristic in nature and susceptible to stronger attacks or (ii) provide guaranteed robustness to worst-case attacks, but are incompatible with state-of-the-art models like BERT. In this work, we introduce robust encodings (RobEn): a simple framework that confers guaranteed robustness, without making compromises on model architecture. The core component of RobEn is an encoding function, which maps sentences to a smaller, discrete space of encodings. Systems using these encodings as a bottleneck confer guaranteed robustness with standard training, and the same encodings can be used across multiple tasks. We identify two desiderata to construct robust encoding functions: perturbations of a sentence should map to a small set of encodings (stability), and models using encodings should still perform well (fidelity). We instantiate RobEn to defend against a large family of adversarial typos. Across six tasks from GLUE, our instantiation of RobEn paired with BERT achieves an average robust accuracy of 71.3% against all adversarial typos in the family considered, while previous work using a typo-corrector achieves only 35.3% accuracy against a simple greedy attack.
more » « less
Full Text Available
Robust encodings: a framework for combating adversarial typos. Transactions of the Association for Computational Linguistics.

Jones, Erik; Jia, Robin; Raghunathan, Aditi' Liang (May 2020, arXiv:2005.01229 ACL 2020)

Despite excellent performance on many tasks, NLP systems are easily fooled by small adversarial perturbations of inputs. Existing procedures to defend against such perturbations are either (i) heuristic in nature and susceptible to stronger attacks or (ii) provide guaranteed robustness to worst-case attacks, but are incompatible with state-of-the-art models like BERT. In this work, we introduce robust encodings (RobEn): a simple framework that confers guaranteed robustness, without making compromises on model architecture. The core component of RobEn is an encoding function, which maps sentences to a smaller, discrete space of encodings. Systems using these encodings as a bottleneck confer guaranteed robustness with standard training, and the same encodings can be used across multiple tasks. We identify two desiderata to construct robust encoding functions: perturbations of a sentence should map to a small set of encodings (stability), and models using encodings should still perform well (fidelity). We instantiate RobEn to defend against a large family of adversarial typos. Across six tasks from GLUE, our instantiation of RobEn paired with BERT achieves an average robust accuracy of 71.3% against all adversarial typos in the family considered, while previous work using a typo-corrector achieves only 35.3% accuracy against a simple greedy attack.
more » « less
Full Text Available
Evaluation Examples are not Equally Informative: How should that change NLP Leaderboards?

https://doi.org/10.18653/v1/2021.acl-long.346

Rodriguez, Pedro; Barrow, Joe; Hoyle, Alexander Miserlis; Lalor, John P.; Jia, Robin; Boyd-Graber, Jordan (January 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers))

Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simplicity can mask nuances in evaluation items (examples) and subjects (NLP models). Rather than replace leaderboards, we advocate a re-imagining so that they better highlight if and where progress is made. Building on educational testing, we create a Bayesian leaderboard model where latent subject skill and latent item difficulty predict correct responses. Using this model, we analyze the ranking reliability of leaderboards. Afterwards, we show the model can guide what to annotate, identify annotation errors, detect overfitting, and identify informative examples. We conclude with recommendations for future benchmark tasks.
more » « less
Full Text Available

Search for: All records